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Abstract 

Voting is a general method for preference aggrega- 
tion in multiagent settings, but seminal results have 
shown that all (nondictatorial) voting protocols are 
manipulable. One could try to avoid manipula- 
tion by using voting protocols where determining 
a beneficial manipulation is hard computationally. 
A number of recent papers study the complexity 
of manipulating existing protocols. This paper is 
the first work to take the next step of designing 
new protocols that are especially hard to manip- 
ulate. Rather than designing these new protocols 
from scratch, we instead show how to tweak ex- 
isting protocols to make manipulation hard, while 
leaving much of the original nature of the protocol 
intact. The tweak studied consists of adding one 
elimination preround to the election. Surprisingly, 
this extremely simple and universal tweak makes 
typical protocols hard to manipulate! The proto- 
cols become NP-hard, #P-hard, or PSPACE-hard 
to manipulate, depending on whether the sched- 
ule of the preround is determined before the votes 
are collected, after the votes are collected, or the 
scheduling and the vote collecting are interleaved, 
respectively. We prove general sufficient condi- 
tions on the protocols for this tweak to introduce 
the hardness, and show that the most common vot- 
ing protocols satisfy those conditions. These are 
the first results in voting settings where manipula- 
tion is in a higher complexity class than NP (pre- 
suming PSPACE ^ NP). 

1 Introduction 

Often, a group of agents has to make a common deci- 
sion, yet they have different preferences about which 
decision is made. Thus, it is of central importance to 
be able to aggregate the preferences, that is, to make 
a socially desirable decision as to which candidate is 
chosen from a set of candidates. Such candidates could 

*The material in this paper is based upon work supported by the 
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be potential presidents, joint plans, allocations of goods 
or resources, etc. Voting is the most general preference 
aggregation scheme, and has been used in several multiagent 
decision makin g problems in AI, such as collaborative 
filtering (e.g. [Pennock ef a/., 2000|) and planning among 
automated agents (e.g. pphrati and Rosenschein, 1991 1 
[Ephrati and Rosenschein, 1993| ). 

A key problem voting mechanisms are confronted with is 
that of manipulation by the voters. An agent is said to vote 
strategically when it does not rank the alternatives accord- 
ing to its true preferences, but differently so as to manipulate 
the outcome to be more favorable to the agent. For exam- 
ple, if an agent prefers Nader to Gore to Bush, but knows that 
Nader has too few other supporters to win, while Gore and 
Bush are close to each other, the agent would be better off by 
declaring Gore as its top candidate. Manipulation is an unde- 
sirable phenomenon. For one, because social choice schemes 
are tailored to aggregate preferences in a socially desirable 
way, and if the agents reveal their preferences insincerely, a 
socially undesirable candidate may be chosen. 

A seminal negative result, the Gibbard-Satterthwaite the- 
orem, shows that if there are three or more candidates, 
then in any nondictatorial voting scheme, there are pref- 
erences under which an agent is better off voting strategi- 
cally iGibbard, 1973'' Satterthwaite, 1975; !. (A voting scheme 
is called dictatorial if one of the voters dictates the social 
choice no matter how the others vote). In automated group 
decision making where the voters are software agents, the 
manipulability of protocols is even more problematic, for at 
least two reasons. First, the algorithms they use to decide how 
to vote must be coded explicitly. Given that the voting algo- 
rithm needs to be designed only once (by an expert), and can 
be copied to large numbers of agents (even ones represent- 
ing unsophisticated human voters), it is likely that rational 
strategic voting will increasingly become an issue, unmud- 
died by irrationality, emotions, etc. Second, software agents 
have more computational power and are more likely to find 
effective manipulations. 

We take the following tack toward avoiding manipulation: 
ensuring that finding a beneficial manipulation is so hard 
computationally that it is unlikely that voters will be able to 
manipulate. So, unlike in most of computer science, here 
high computational complexity is a desirable property. The 
harder it is to manipulate, the better. 



Prior work on the complexity of manipulating elections 
has focused on existing protocols iBartholdi ef a/., 19891 
partholdi and Orlin, 1991 ; Conitzer and Sandholm, 2002a|. 
This paper is the first to take the next step of designing 
new protocols that are especially hard to manipulate. Rather 
than designing these protocols from scratch, we show how to 
tweak existing protocols to make manipulation computation- 
ally much more difficult, while leaving much of the original 
nature of the protocol intact, for the following reasons: 

• Results on the computational complexity induced by a 
tweak typically apply to a large family of protocols. 

• Many of the original protocol's nice theoretical proper- 
ties are preserved by the tweak. 

• In practice, it will be much easier to replace a currently 
used protocol with a tweaked version of it, than with an 
altogether new protocol. 

The type of tweak we study in this paper is the follow- 
ing. All the candidates are paired in a prewund; of each pair 
of candidates, only the winner of their pairwise election sur- 
vives. (The winner of the pairwise election between two can- 
didates is the candidate that is ranked above the other more 
often in the votes.) After the preround, the original protocol 
is executed on the remaining candidates. The schedule of the 
preround (i.e., who faces who) can be determined before the 
votes are collected; after the votes are collected; or while the 
votes are collected (the processes are interleaved). We study 
these three cases in Sections|4]|5l and|6l respectively. ' 

2 Definitions 

2.1 Elections and voting protocols 

An election consists of a set of candidates C; a set of voters 
V; and a protocol for deciding on a winner w E C given all 
the voters' votes. (Here, a vote is a total ordering of the candi- 
dates.) A deterministic protocol is a function from the set of 
all combinations of votes to C. (All our results hold even for 
unweighted voters, so in this paper this function will always 
treat the voters symmetrically.) A randomized protocol is a 
function from the set of all combinations of votes to proba- 
bility distributions over C. An interleaved protocol is a pro- 
cedure for alternating between collecting parts of the voters' 
votes (e.g. whether they prefer candidate a to candidate b) 
and drawing and publishing random variables (such as parts 
of the schedule for an election), together with a function from 
the set of all combinations of votes and random variables to 

'The high complexity results obtained in this paper are depen- 
dent on the number of candidates growing. This places them in 
line with all the early results in this area iBartholdi ef a/., 1989. 
IBartholdi a nd Orlin, 1991 1, but in contrast with more recent re- 
sults [Conitzer and Sandholm, 2002a I that show high complexity of 
manipulation with a constant number of candidates for some pro- 
tocols. Having high complexity of manipulation occur with a con- 
stant number of candidates already is certainly preferable to having 
it occur only with a growing number. On the other hand, unlike in 
that paper, the results here hold even when the voters all have equal 
weight, and even when manipulation is attempted by an individual 
rather than a coalition, making the results in this paper stronger in 
that sense, so there is a tradeoff. 



C. (Collecting only parts of the voters' preferences is also 
known as elicitation.) 

The standard definitions of most voting protocols allow for 
the possibility of ties between candidates, in which case a tie- 
breaking rule is required to fully specify the protocol. All our 
results hold for any tie-breaking rule, so we do not need to 
specify such rules here. 

In this paper we apply our technique to the most common 
voting protocols (in these definitions, whenever points are de- 
fined, the candidate with the most points wins): 

• Plurality. A candidate receives 1 point for every voter 
that ranks it first. (Thus, the voters effectively only vote 
for one candidate.) 

• Borda. For each voter, a candidate receives m — 1 points 
if it is the voter's top choice, to — 2 if it is the second 
choice, if it is the last. 

• Maximin. A candidate's score in a pairwise election is 
the number of voters that prefer it over the opponent. A 
candidate's number of points is the lowest score it gets 
in any pairwise election. 

• Single Transferable Vote (STV). The winner determina- 
tion process proceeds in rounds. In each round, a candi- 
date's score is the number of voters that rank it highest 
among the remaining candidates, and the candidate with 
the lowest score drops out. The last remaining candidate 
wins. (A vote '"transfers" from its top remaining can- 
didate to the next highest remaining candidate when the 
former drops out.) 

2.2 Preround 

The tweaks we study in this paper all involve the addition of 
a preround. We will now define how this works. 

Deflnition 1 Given a protocol P, the new protocol obtained 
by adding a preround to it proceeds as follows: 

1. The candidates are paired. If there is an odd number of 
candidates, one candidate gets a bye. 

2. In each pairing of two candidates, the candidate losing 
the pairwise election between the two is eliminated. A 
candidate with a bye is never eliminated. 

3. On the remaining candidates, P is executed to produce 
a winner For this, the implicit votes over the remaining 
candidates are used. (For example, if a voter voted a >- 
b )^ c )^ d >- e, and b and c were eliminated, the voter's 
implicit vote is a )^ d >- e.) 

The pairing of the candidates is also known as the schedule 
for the preround. If the schedule is decided and published be- 
fore the votes are collected, we have a deterministic preround 
(DP RE), and the resulting protocol is called DP RE + P. If 
the schedule is drawn completely randomly after the votes are 
collected, we have a randomized preround (RPRE), and the 
resulting protocol is called RPRE + P. Finally, if the votes 
are elicited incrementally, and this elicitation process is in- 
terleaved with the scheduling-and-publishing process ( which 
is again done randomly), as described in detail in Section^ 
we have an interleaved preround (I PRE), and the resulting 
protocol is called I PRE + P. 



2.3 Manipulation 

We now define the computational problem of manipulation. 
Because all our hardness results hold even when the voters 
are unweighted, only a single voter is trying to manipulate, 
and all the other voters' votes are known to the manipulator, 
we will only define this simple setting here. Any hardness 
results in this simple setting immediately imply hardness in 
all more general settings. 

Definition 2 (CONSTRUCTIVE-MANIPULATION) 

We are given a protocol P, a candidate set C, a preferred 
candidate p, and a set of votes S corresponding to all the 
other voters' votes. The manipulator has yet to decide on 
its vote, and wants to make p win. Then the constructive 
manipulation question is: 

• (For deterministic protocols) Can the manipulator cast 
its vote to make p win under P? 

• (For randomized protocols) Can the manipulator cast its 
vote to make the probability of p winning under P at 
least some given fee [0,1].^ 

• (For interleaved protocols) Given the initial random 
choices (if any) by the protocol, is there a contingency 
plan (based on the random decisions the protocol takes 
between eliciting parts of the votes) for the manipula- 
tor to answer the queries to make the probability of p 
winning under P at least some given k G [0,1].^ 

3 Complexity of manipulating untweaked 
protocols 

In this section, we briefly review the complexity of 
manipulating voting protocols, as a benchmark for our 
results. CONSTRUCTIVE-MANIPULATION is in P 
for the Plurality, Borda, and Maximin voting proto- 
cols iBartholdi ef g/., 19 891. The only voting protocol for 
which CONSTRUCTIVE-MANIP ULATION is known to be 
NP-hard is the STV protocol I Bartholdi and Orlin, 1991j .^ 

4 NP-hardness when scheduling precedes 
voting 

In this section, we examine the complexity induced by the 
preround when the voters know the schedule before they vote. 

4.1 A sufficient condition for NP-hardness 

We present a sufficient condition under which adding a pre- 
round with a preannounced schedule makes manipulation 
NP-hard. The condition can be thought of as an NP-hardness 
reduction template. If it is possible to reduce an arbitrary SAT 
instance to a set of votes satisfying certain properties under 
the given voting protocol, that protocol — with a preround — 
is NP-hard to manipulate. 

Theorem 1 Given a voting protocol P, suppose that it is pos- 
sible, for any Boolean formula (j) in conjunctive normal form 

^CONSTRUCTIVE-MANIPULATION is N P-hard als o for the 
Second Order Copeland protocol I Bartholdi et al., 1989| , but the 
hardness is driven solely by the tie-breaking rule. 



(i.e., a SAT instance), to construct in polynomial time a set 
of votes over a candidate set containing at least {p} U Cl 
where Cl = {c/ : I € L} (L is the set of literals {+« : v S 
V} U w : V g V}, where V is the set of variables used in 
(j)), with the following properties: 

• (Property la) If we remove, for each v Q V, one of 
c+v and C-v, p would win an election under protocol P 
against the remaining candidates if and only if for every 
clause k G K (where K is the set of clauses in (j)), there 
is some I S L such that ci has not been removed, and I 
occurs in k. This should hold even if a single arbitrary 
vote is added. 

• (Property lb) For any v £ V, and C-y are tied in 
their pairwise election after these votes. 

Then CONSTRUCTIVE-MANIPULATION in DP RE + P is 
NP-hard ( and NP-complete if P is deterministic and can be 
executed in polynomial time). 

Proof: Consider the following election under DPRE + P. 
Let the candidate set be the set of all candidates occurring 
in the votes constructed from </> (the "original candidates"), 
plus one dummy candidate for each of the original candidates 
besides those in Cl- To each of the constructed votes, add 
all the dummy candidates at the bottom; let the resulting set 
of votes be the set of the nonmanipulators' votes. A single 
manipulator's vote is yet to be added. Let the schedule for 
the preround be as follows: for each v, c+„ and c_„ face 
each other in the preround; and every other original candidate 
faces (and, because of the dummy candidates' position in the 
votes, defeats) a dummy candidate. Thus, the set of candi- 
dates that make it through the preround consists of, for each 
V G V, one of c+„ and c_^,; and all the other original candi- 
dates. The manipulator's vote will decide the winner of every 
c+t, vs. c_i, match-up, because by property lb, all these pair- 
wise elections are currently tied. Moreover, it is easy to see 
that the manipulator can decide the winner of each of these 
match-ups independently of how it decides the winners of the 
other match-ups. Thus, we can think of this as the manipu- 
lator giving the variables truth-values: v is set to true if c+v 
survives, and to false if c_„ survives. By property la it then 
follows that p wins if and only if the manipulator's assign- 
ment satisfies all the clauses, i.e. is a solution to the SAT 
instance. Hence there is a successful constructive manipu- 
lation if and only if there is a solution to the SAT instance, 
and it follows that CONSTRUCTIVE-MANIPULATION in 
DPRE + Pis NP-hard. (It is also in NP if P is deterministic 
and can be executed in polynomial time, because in this case, 
given a vote for the manipulator, it can be verified in polyno- 
mial time whether this vote makes p win). ■ 

4.2 Examples 

We now show how to apply Theorem [l] to the well-known 
protocols we discussed, thus showing that each of these 
protocols — with a preround — is NP-hard to manipulate. 

Theorem 2 There exists a reduction that satisfies properties 
la and lb of Theorem\l\under the Plurality protocol. 



When it does not matter for our proofs whether a given vote 
is a y b y c or b )^ a )~ c, we write {a, b} >- c. 

Proof: Given the formula 0, let the candidate set be the min- 
imally required candidates {p} U Cl, plus a set of candidates 
corresponding to the set of clauses K of (j), Ck — {ck '■ k E 
K}. Then, let the set of votes be as follows: 4:\K\ + 2 votes 
ranking the candidates p >- Cl >~ Ck', for each k E K , A\K\ 
votes ranking the candidates Ck >- {cd £ Ck cl ^ k} )~ 
Cl >- P\ and for each k E K, A votes ranking the candidates 
{ci E Cl ■■ I E k} ^ Ck ^ {ci € Cl ■■ I i k} ^ {c^i <E 
Ck : cl k} y p. Additionally, we require that these votes 
are such that after counting them, for each v E V, c+t, and 
C-v are tied in their pairwise election, so that property lb is 
satisfied. (This is possible because the total number of votes 
is even, and the majority of the votes do not yet have any re- 
strictions on the order of the Cl-) We now show property la 
is satisfied. We first observe that regardless of which of the 
candidates corresponding to literals are removed, p will get 
4|i^| + 2 votes. Now, if for some k E K, all the candidates 
Cl with I G L,l £ fc are removed, then Ck will get at least 
A\K\ +4 votes and p will not win. On the other hand, if for 
each k G K, at least one candidate ci with I G k remains, 
then each of the Ck will get precisely A\K\ votes. Because 
each remaining c; can get at most A\K\ votes as well, p will 
win. In both cases there is a "margin" of at least 2, so a sin- 
gle additional vote will not change this. Thus, property la is 
satisfied. ■ 

Theorem 3 There exists a reduction that satisfies properties 
la and lb ofTheorem^imder the Borda protocol. 

Proof: Given the formula 0, let the candidate set be the min- 
imally required candidates {p} U Cl', plus a set of candi- 
dates corresponding to the set of clauses K of 0, Ck = 
{ck : k G K}, which we order in some arbitrary way 
to get {ci, . . . , C|^|}. Let AI be the total number of can- 
didates this defines. Then, let the set of votes be as fol- 
lows: for every Ci G Ck, 4Ajf votes ranking the candidates 

Cj+i >- Cj+2 > ■■■ ^ C\K\ > P > Cl C2 > ■ ■ ■ ^ 

Ci-i >- {ci E Cl ■ I E Ci] >- Ci >- {ci E L : I ^ a]; 
(here, the slight abuse of notation I E Ci means that / oc- 
curs in the clause corresponding to c^;) AM votes ranking the 
candidates ci > C2 >- ... >- c\k\ ^ P ^ Cl', one vote 
Cl C2 ^ . ■ ■ y- c\K\ >- Cl >~ P', one vote c\k\ >- c\k\-i ^ 
. . . )~ Cl >- Cl >- p', and finally, A\K\M votes ranking the 
candidates p y ci y C2 >~ ■ ■ ■ >~ Cn >- Cl, and 41/^11/ 
votes ranking the candidates c„ >- c„_i >- . . . ci > p > Cl- 
Additionally, we require that these votes are such that after 
counting them, for each v eV, c+„ and c_t, are tied in their 
pairwise election, so that property lb is satisfied. (This is 
possible because the total number of votes is even, and the 
majority of the votes do not yet have any restrictions on the 
order of the c;.) We now show property la is satisfied. It is 
easy to see that none of the c; can win, regardless of which of 
them are removed. Thus, we only need to consider the Ci and 
p. The last Sj/^lAf votes will have no net effect on the rela- 
tive scores of these candidates, so we need not consider these 
here. After the first 4(1/^1 + \)M votes, any Ck for which 
all the Cl with I E k have been removed will be tied with p. 



and any other Cfe will be at least AM points behind p. Finally, 
from the last remaining two votes, any Ck {k E K) will gain 
2M — 2|y I — jiiT I — 1 points on p. It follows that p wins if 
and only if for every clause k E K, there is some I E L with 
/ G k such that ci has not been removed. In both cases there 
is a "margin" of at least Af — points, so a single additional 
vote will not change this. Thus, property la is satisfied. ■ 

Theorem 4 There exists a reduction that satisfies properties 
la and lb of Theorem^under the Maximin protocol. 

Proof: Given the formula (p, let the candidate set be the min- 
imally required candidates {p} U Cl, plus a set of candidates 
corresponding to the set of clauses K of (p, Ck — {ck ■ 
k E K}. Then, let the set of votes be as follows: 8\K\ votes 
ranking the candidates p >- Cl >- Ck, 8\K\ votes rank- 
ing the candidates Cl >~ Ck >- P, and 8\K\ votes ranking 
the candidates Ck ^ p >~ Cl', ^\K\ votes ranking the can- 
didates Cl y p >■ Ck, M^l votes ranking the candidates 
Ck >- Cl >- p, and, for each k E K, A votes ranking the 
candidates p >~ {cd E Ck '■ cl ^ k} >~ {ci E Cl '■ I E k} y- 
Ck >- {ci E Cl '■ I ^ k}; and finally, 2 votes ranking the can- 
didates p >- Ck y Cl, and 2 votes ranking the candidates 
Ck ^ p >~ Cl- Additionally, we require that these votes 
are such that after counting them, for each v E V, c+y and 
C-v are tied in their pairwise election, so that property lb is 
satisfied. (This is possible because the total number of votes 
is even, and the majority of the votes do not yet have any 
restrictions on the order of the c;.) We now show property 
la is satisfied. Regardless of which of the candidates corre- 
sponding to literals are removed, p's worst score in a pairwise 
election is against any of the Ck, namely 16|iir| + 2. Any Ck 
for which all the ci with / G k have been removed will get its 
worst pairwise election score against any of the Cl, namely 
16|ii'| + 4. Finally, any other Ck will get its worst pairwise 
election score against one of the q with I E k, namely, 16|i^ |. 
It follows that p wins if and only if for every clause k E K, 
there is some I E k such that c; has not been removed. In both 
cases there is a "margin" of at least 2, so a single additional 
vote will not change this. Thus, property la is satisfied. ■ 

Theorem 5 There exists a reduction that satisfies properties 
la and lb ofTheorem^under the STV protocol. 

Proof: We omit the proof due to limited space. ■ 

Theorem 6 In any of DPRE + Plurality, DPRE + 
Borda, DPRE + Maximin, and DPRE + STV^ , 
CONSTRUCTIVE-MANIPULATION is NP-complete. 

Proof: NP-hardness is immediate from the previous theo- 
rems. The problem is in NP because these protocols can be 

^^The NP-completeness of manipulating DPRE + STV is, in 
itself, not that interesting, because STV is already NP-hard to ma- 
nipulate without the preround as we discussed. Nevertheless, our 
method highlights a different aspect of the NP-hardness of manip- 
ulating DPRE + STV. We build on this reduction later to prove 
PSPACE-hardness of manipulating STV with a preround when the 
scheduling of the preround is interleaved with the vote elicitation. 



executed in polynomial time. ■ 

In the next sections, we will raise the bar and bring 
the problem of manipulating elections to higher complexity 
classes by abandoning the assumption that the schedule for 
the preround should be known in advance. 

5 #P-hardness when voting precedes 
scheduling 

In this section, we will examine the complexity induced by 
the preround when the schedule is drawn completely (uni- 
formly) randomly after all the votes have been collected. 

5.1 A sufficient condition for #P-hardness 

We present a sufficient condition for a voting protocol to be- 
come #P-hard"^ to manipulate in this setting. Again, this con- 
dition can be thought of as a reduction template. If it is possi- 
ble to reduce an arbitrary PERMANENT instance to a set of 
votes satisfying certain properties under the given voting pro- 
tocol, that protocol is #P-hard to manipulate when a random- 
ized preround is added to it. (In the PERMANENT problem, 
we are given a bipartite graph B with the same number of 
vertices k in both parts, and are asked how many matchings 
there are. This problem is #P-complete [Valiant, 19 791.) 

Theorem 7 Given a voting protocol P, suppose that it is pos- 
sible, for any bipartite graph B with the same number of ver- 
tices k in both parts (labeled 1 to k in one part, k -\- I to 2k 
in the other), to construct in polynomial time a set of votes 
over the candidate set {ci , . . . , C2k , p} ( where Ci corresponds 
to vertex i in B) with the following properties: 

• (Property 2a) If we remove k of the Ci, p would win an 
election under protocol P against the remaining Ci if and 
only if the removed Ci are exactly all the Ci with /c + 1 < 

i < 2k; 

• (Property 2b) p loses its pairwise election against all Ci 
with k + l<i<2k; 

• (Property 2c) For any 1 < i < k and k + 1 < j < 2k, Ci 
defeats Cj in their pairwise election if and only if in B, 
there is an edge between vertices i and j. 

• (Property 2d) All the previous properties still hold with 
any additional single vote. 

Then CONSTRUCTIVE-MANIPULATION in RPRE + P is 
#P-hard. 

Proof: Given the set of votes constructed on the basis of an ar- 
bitrary B, let us compute the probability that p wins under the 
protocol RPRE + P with only these votes. In the preround, 
there are k matches and one bye. By property 2a, p will win 
the election if and only if the k candidates eliminated in this 
preround are precisely all the c; with /c + 1 < z < 2k. By 
property 2b, p could not win a preround match against any of 
these, so p will win the election if and only if it gets the bye, 
and each of the Cj with fc+1 < j < 2k faces one of the q with 
1 < i < k that defeats it in the preround. Then, by property 

is the class of problems where the task is to count the number 
of solutions to a problem in NP. 



2c, it follows that p wins if and only if the preround pairing 
corresponds to a matching in B. Thus the probability of p 
winning is e(2fc™ fc+i) ' where is the number of matchings 
in B and e(2fc, 2A; + 1) is the number of different ways to pair 
2fc of the 2fc + 1 candidates in the preround (which is straight- 
forward to compute). Thus, evaluating p's chances of win- 
ning in this election is at least as hard as counting the num- 
ber of matchings in an arbitrary B, which is #P-hard. More- 
over, because we can compute p's chances of winning solely 
on the basis of properties 2a, 2b, and 2c, and by property 
2d, these properties are maintained for any single additional 
vote, it follows that a manipulator cannot affect p's chances 
of winning. Thus, CONSTRUCTIVE-MANIPULATION in 
this case simply comes down to computing p's chances of 
winning, which is #P-hard as demonstrated. ■ 

5.2 A broadly applicable reduction 

In this subsection we present a single broadly applicable re- 
duction which will satisfy the precondistions of Theorem 
for many voting protocols, including all of the protocols dis- 
cussed in this paper, thus proving them #P-hard to manipulate 
when the voting precedes the preround scheduling. 

Definitions We label the following reduction Ri. Given a 
bipartite graph B with the same number of vertices k in both 
parts (labeled 1 to k in one part, k-\-\ to 2k in the other), we 
construct the following set of\2k^ + 2k^ votes: 

• Qk^ votes that rank the candidates Ck+i )- Ck+2 >■■■>■ 

C2k p ci C2 y . . . y Ck; 

• 3fc^ votes that rank the candidates p y Ck >- Ck-i >- 
. . . >~ ci >~ C2k >~ C2k-i Ck+i; 

• 6k^ — 3fc^ votes that rank the candidates Ck >- Ck-i >- 
■ . . >- ci >~ C2k >■ C2k-i >-...)- Ck+1 >- p; 

• For each edge (i, j) in B (1 < i < k, k + I < j < 2k), 
one vote that ranks the candidates Ci )~ Cj )~ p )^ ci >- 

C2>- .. ■>- Ci-i >- Ci+i y . . . y Ck y Ck+1 >- Ck+2 >- 

. . . Cj-i >~ Cj+i )^ . . . C2k, and another one that 
ranks them C2k >- C2k-i >~ ■ ■ ■ Cj+i >- cj^i ^ . . . >- 

Ck+1 y Ck y Ck-i y . . . y Ci+i >~ Ci-i >~ . . . >~ ci >- 

p )^ Ci >- Cj (i.e., the inverse of the former vote, apart 
from Ci and Cj which have maintained their order); 

• For each pair i,j without an edge between them in B 
(l<i<k, k + \<j< 2k), one vote that ranks the 
candidates Cj )~ Ci >~ p >~ ci )~ C2 )~ ■ ■ ■ >- Ci_i >~ 

Ci+l >-...>- Ck >- Ck+l >~ Ck+2 y . ■ ■ y- Cj^i >- 

Cj+i y . . . >- C2k, ond another one that ranks them 

C2k >- C2k-i y ■ ■ . y- Cj+i y Cj-i y . . . y Ck+i y 
Ck y cfc_i y ... y q+i y q_i y ... y ci y p y 
Cj y Ci (i.e., the inverse of the former vote, apart from 
Cj and Ci which have maintained their order). 

We now have to show that this reduction satisfies the pre- 
conditions of Theorem^ We start with the properties that are 
protocol-independent. 

Theorems Ri satisfies properties 2b and 2c of Theorem^ 
( under any protocol P, because these properties are indepen- 
dent of P), even with a single additional arbitrary vote. 



Proof: In the pairwise election between p and any one of the 
Ci with k + 1 < i < 2k, p is ranked higher in only 4k'^ 
votes, and thus loses the pairwise election. So property 2b 
is satisfied. For a pairwise election between some Ci and cj 
(1 < i < k and k + 1 < j < 2k), the first 12P votes' 
net contribution to the outcome in this pairwise election is 
0. Additionally, the two votes associated with any pair q, r 
(I < q < k and A: + 1 < r < 2k) also have a net contribution 
of 0, if either q ^ i or r j. The only remaining votes are 
the two associated with the pair so Ci wins the pairwise 
election by 2 votes if there is an edge in B, and Cj wins 
the pairwise election by 2 votes otherwise. So property 2c 
is satisfied. Because both are satisfied with a "margin" of at 
least 2, a single additional vote will not change this. ■ 

Finally, because property 2a is protocol-dependent, we 
need to prove it for our reduction on a per-protocol basis. This 
is what the following four theorems achieve. 

5.3 Examples 

Theorem 9 Ri satisfies property 2a ofTheorem^under the 
Plurality protocol. This holds even when there is a single 
additional arbitrary vote. 

Proof: If at least one of the Ci with k + l<i<2k\s, not 
removed, p can get at most 5fc^ votes, whereas the lowest- 
indexed remaining candidate among the Ci with fc + 1 < i < 
2k will get at least 6fc^ votes, so p does not win. On the other 
hand, if all the q with k + l < i <2k are removed, p will get 
at least Qk^ + 3fc^ votes, which is more than half the votes, 
so p wins. In both cases there is a "margin" of at least 2, so a 
single additional vote will not change this. ■ 

Theorem 10 i?i satisfies property 2a ofTheorem^under the 
Borda protocol. This holds even when there is a single addi- 
tional arbitrary vote. 

Proof: If at least one of the Ci with fc + 1 < « < 2fc is not 
removed, consider the highest-indexed remaining candidate 
among the Ci with fc + 1 < i < 2fc; call it h. The first 12fc'^ 
votes will put h at least %k^ — 3fc^ points ahead of p. {\2k:^ — 
3fc^ of them rank h above p, and the Sfc'^ others can give p an 
advantage of at most fc each.) The 2fc^ remaining votes can 
contribute an advantage to p of at most fc each, and it follows 
that h will still have at least 7k^ — 3fc^ more points than p. So 
p does not win. On the other hand, if all the q with fc + 1 < 
I < 2fc are removed, then there are two groups of Qk^ — 
3fc^ among the first 12fc'^ votes which (over the remaining 
candidates) are each other's exact inverses and hence have 
no net effect on the scores. Also, the last 2fc^ votes, which 
are organized in pairs, have no net effect on the score because 
(over the remaining candidates) the votes in each pair are each 
other's exact inverse. The remaining votes all rank p highest 
among the remaining candidates, so p wins. In both cases the 
"margin" is big enough that a single additional vote will not 
change this. ■ 

Theorem 11 i?i satisfies property 2a ofTheorem\7\under the 
Maximin protocol. This holds even when there is a single 
additional arbitrary vote. 



Proof: If at least one of the with fc + 1 < i < 2k is 
not removed, then in any pairwise election between such a 
candidate and p, p will get at most 5fc^ votes. However, the 
lowest-indexed remaining candidate among the Ci with fc + 
1 < i < 2fc will get at least 6k^ votes in every one of its 
pairwise elections. So p does not win. On the other hand, if 
all the Ci with fc + l<i<2fcare removed, p will get at least 
6fc^ + 3fc^ votes in every one of its pairwise elections, which 
is more than half the votes; so p wins. In both cases there is 
a "margin" of at least 2, so a single additional vote will not 
change this. ■ 

Theorem 12 Ri satisfies property 2a ofTheorem^under the 
STV protocol. This holds even when there is a single addi- 
tional arbitrary vote. 

Proof: If at least one of the with fc + 1 < i < 2k is 
not removed, consider the lowest-indexed remaining candi- 
date among the Ci with fc + l<i<2fc; call ill. I will hold at 
least Gfc"^ votes as long as it is not eliminated, and p can hold 
at most 5fc^ votes as long as / is not eliminated. It follows that 
p will be eliminated before I, so p does not win. On the other 
hand, if all the the c; with fc + l<i<2fcare removed, p will 
hold at least 6fc'^ + 3fc^ votes throughout, which is more than 
half the votes; so p cannot be eliminated and wins. In both 
cases there is a "margin" of at least 2, so a single additional 
vote will not change this. ■ 

Theorem 13 In any of RPRE + Plurality, RPRE + 
Borda, RPRE + Maximin, and RPRE + STV, 
CONSTRUCTIVE-MANIPULATION is #P-hard. 

Proof: Immediate from the previous theorems. ■ 

6 PSPACE-hardness when scheduling and 
voting are interleaved 

In this section, we increase the complexity of manipulation 
one more notch, to PSPACE-hardness,^ by interleaving the 
scheduling and vote elicitation processes. 

We first discuss the precise method of interleaving required 
for our result. The method is detailed and quite complicated. 
Nevertheless, this does not mean that the interleaving should 
always take place in this particular way in order to have the 
desired hardness. If the interleaving method used for a partic- 
ular election is (say, randomly) chosen from a wider (and pos- 
sibly more naturally expressed) class of interleaving methods 
containing this one, our hardness result still goes through, as 
hardness carries over from the specific to the general. Thus, 
our goal is to find the most specific method of interleaving 
for which the hardness still occurs, because this gives us the 
most information about more general methods. We only de- 
fine the method for the case where the number of candidates 
is a multiple of 4 because this is the case that we will reduce 
to (so it does not matter how we generalize the protocol to 
cases where the number of candidates is not a multiple of 4). 

Definition 4 I PRE proceeds as follows: 

^PSPACE is the class of problems solvable in polynomial space. 



1. Label the matchups (a matchup is a space in the pre- 
round in which two candidates can face each other; at 
this point they do not yet have candidates assigned to 
them) 1 through 

2. For each matchup i, assign one of the candidates to play 
in it, and denote this candidate by c(i, 1). Thus, one of 
the candidates in each matchup is known. 

3. For some k which is a multiple of 4, for each i with 1 < 
i < k, assign the second candidate to play in matchup i, 
and denote this candidate c{i, 2). Thus, we have k fully 
scheduled matchups. 

4. For each pair of matchups (2i — 1 , 2i) with i > ^, assign 
two more candidates to face the candidates already in 
these two matchups, and denote them c((2i — 1, 2i), 1) 
and c((2i — 1, 2i), 2). (Thus, at this point, all that still 
needs to be scheduled is, for each i, which of these two 
faces c{2i — 1, 1) and which c(2i, 

5. Fori^^ + \to 

• Randomly decide which of c{{2i—l, 2i), 1) and c{{2i — 
1, 2i), 2) faces c(2i — 1, 1), andwhichfaces c(2i, 1). De- 
note the former c{2i — 1, 2), the latter c{2i, 2), 

• Ask all the voters whether they prefer c{i — ^, 1) or 
c{i — |, 2). (We observe that, even if the number of al- 
ready scheduled matchups is k — 0, the elicitation pro- 
cess trails behind the scheduling process by a factor 2.) 

6. Elicit the remainder of all the votes. 

One important property of this elicitation process is that 
the voters are treated symmetrically: when a query is made, 
it is made to all of the voters in parallel. Thus, no voter 
gets an unfair advantage with regard to knowledge about 
the schedule. Another important property is that the elici- 
tation and scheduling process at no point depends on how 
the voters have answered earlier queries. Thus, voters cannot 
make inferences about what other voters replied to previous 
queries on the basis of the current query or the current knowl- 
edge about the schedule. These two properties guarantee that 
many issu es of strategic voting that may occur with vote elic- 
itation ^Conitzer and Sandholm, 2002b | in fact do not occur 
here. 

We are now ready to present our result. 

Theorem 14 Given a voting protocol P, suppose that it is 
possible, for any Boolean formula (p in conjunctive normal 
form (i.e., a SAT instance) over variables V — X \JY with 
\X\ = \Y\ (and corresponding literals L), to construct in 
polynomial time a set of votes over a candidate set containing 
at least {p}UCL^{Cy : y € Y} with the following properties: 

• (Property 3a) If we remove, for each v d V, one of 
c+y and c^y, p would win an election under protocol P 
against the remaining candidates if and only if for every 
clause k G K (where K is the set of clauses in (p), there 
is some I £ L such that ci has not been removed, and I 
occurs in k. This should hold even if a single arbitrary 
vote is added. 

• (Property 3b) For any x G X, and c_x are tied in 
their pairwise election after these votes. 



• (Property 3c) For any y £Y, Cy and c^y are both losing 
their pairwise elections against Cy by at least 2 votes ( so 
that they will lose them regardless of a single additional 
vote). 

Then CONSTRUCTIVE-MANIPULATION in IPRE + P is 
PSPACE-hard (and PSPACE-complete if P can be executed 
in polynomial space). 

Proof: Consider the following election under /PJSi^+P. Let 
the candidate set be the set of all candidates occurring in the 
votes constructed from (j) (the "original candidates"), plus one 
dummy candidate for each of the original candidates besides 
the c+„ and c^y. To each of the constructed votes, add all the 
dummy candidates at the bottom; let the resulting set of votes 
be the set of the nonmanipulators' votes, according to which 
they will answer the queries posed to them. The manipulator 
has yet to decide on its strategy for answering queries. Af- 
ter step 4 (according to Definition |3 of IPRE + P (up to 
which point the manipulator will not have had to make any 
decisions), let the situation be as follows: 

• The number of already fully scheduled matchups is fc = 

— 2|F|. In matchup i{l < i < \X\), c+x, faces c^xf 
In the remaining fully scheduled matchups, candidates 
not corresponding to a literal face a dummy candidate. 

• Matchups fc + 2i - 1 and /c + 2i (1 < « < |F|) already 
have candidates c+y. and c-y. in them, respectively. The 
other two candidates to be assigned to these rounds are 
Cy. and a dummy candidate. 

Thus, what will happen from this point on is the following. 
For i ranging from 1 to \X\, first the protocol will schedule 
which of c+j,; and c_y. face which of Cy. and the dummy 
candidate. The c/ facing the dummy will move on, and the 
other will be defeated by Cy. , by property 3c. Second, every- 
one will be asked which of c^^i and c_j;. is preferred, and 
because the nonmanipulators will leave this pairwise election 
tied by property 3b, the manipulator's vote will be decisive. 
Thus, we can think of this as nature and the manipulator alter- 
natingly giving the variables in Y and X respectively truth- 
values: V is set to true if c+y survives, and to false if c_„ 
survives. By property 3a it then follows that p wins if and 
only if the resulting assignment satisfies all the clauses, i.e. is 
a solution to the SAT instance. Thus, the manipulator's strat- 
egy for setting variables should aim to maximize the chance 
of the SAT instance being satisfied eventually. But this is 
exactly the problem STOCHASTIC-SAT, which is PSPACE- 
complete I Papadimitriou, 19851. 

If P can be executed in polynomial space, the manipulator 
can enumerate all possible outcomes for all possible strategies 
in polynomial space, so the problem is also in PSPACE. ■ 

Because the preconditions of Theorem [H] are similar to 
those of Theorem^ we can build on our previous reductions 
to apply this theorem to the well-known protocols. 

Theorem 15 For each of Plurality, Borda, Maximin, 
and STV, there exists a reduction that satisfies properties 3a, 
3b and 3c ofTheorem \14\ Thus, In any of IPRE + Plurality, 
IPRE + Borda, IPRE + Maximin, and IPRE + STV, 
CONSTRUCTIVE-MANIPULATION is PSPACE-complete. 



Sketch of Proof: We can modify the reductions from Sec- 
tion 13 to satisfy the preconditions of Theorem This is 
done by adding in the Cy in such a way as to achieve property 
3c (ranking them just above their corresponding Cy and c_j, 
in sHghtly more than half the votes), while preserving prop- 
erty 3a (by ranking them as low as possible elsewhere). ■ 

7 Conclusions 

Voting is a general method for preference aggregation in mul- 
tiagent systems, but seminal results have shown that all (non- 
dictatorial) voting protocols are manipulable. One could try 
to avoid manipulation by using voting protocols where deter- 
mining a beneficial manipulation is hard computationally. A 
number of recent papers study the complexity of manipulat- 
ing existing protocols. 

This paper is the first work to take the next step of design- 
ing new protocols that are especially difficult to manipulate. 
Rather than designing these new protocols from scratch, we 
instead showed how to tweak existing protocols to make ma- 
nipulation hard, while leaving much of the original nature 
of the protocol intact. The tweak studied in this paper con- 
sists of adding one preround to the election, where candi- 
dates face each other one against one. The surviving can- 
didates continue to the original protocol. Surprisingly, this 
extremely simple and universal tweak makes typical proto- 
cols hard to manipulate! The protocols become NP-hard, #P- 
hard, or PSPACE-hard to manipulate, depending on whether 
the schedule of the preround is determined before the votes 
are collected, after the votes are collected, or the schedul- 
ing and the vote collecting are interleaved, respectively. We 
proved general sufficient conditions on the protocols for this 
tweak to introduce the hardness, and showed that the most 
common voting protocols satisfy those conditions. These are 
the first results in voting settings where manipulation is in a 
higher complexity class than NP (presuming PSPACE 7^ NP). 
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